Random phase vector for calculating the trace of a large matrix 
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We derive an estimate of statistical error in calculating the trace of a large matrix by using random 
vector, and show that random phase vector gives the results with the smallest statistical error for 
a given basis set. This result supports use of random phase vectors in the calculation of density of 
states and linear response functions of large quantum systems. 
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In fast algorithm called Order-N methods for calculat- 
ing DOS and linear response functions 0, 0, 0, 0, IE IE S 
S la EE m III EE 13 J 5., 16] , Monte Carlo calculation 
of the trace of large matrices by using random vector of- 
ten plays an important role. The central limit theorem 
guarantees the convergence of the result as the sample 
number K increases. Further experience shows another 
useful feature called self-averaging effect: the fluctuation 
in some physical quantities such as energy density and 
linear response functions decreases as the dimension N 
of the Hilbert space increases ,9^ ,10. 11. 12, 13, 14. 15. 
These two types of convergence make the random vector 
a very efficient numerical tool. 

A special class of random vector called random, phase 
vector 0, 0, IE| has been used in later papers 0, 0, 0, 
IT^ IT^j l without examining its efficiency rigorously. In 
this article, we prove , by following the scheme of [lol |. 
that random phase vector is really the most efficient ran- 
dom vector in a wide class of random vectors. Then we 
illustrate the mechanism of self-averaging with a simple 
model Hamiltonian. 

A (complex) random vector is defined by 
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where {\n)} is the basis set used in the computation and 
^„ are a set of complex random variables with identical 
probability distribution satisfying the statistical relations 
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where (( • )) stands for the statistical average. This class 
of random vectors defined in one orthonormal basis set 
have the coefficients Ci in another orthonormal basis set 
\l) (for example, energy eigenstates \Ei) of a Hamiltonian 



* Elec tronic address: 

URL: jhttp : //vmi . litaka . orgT] 



|tiitaka@riken.jpl 



N 



ii>)-Eioc/ 



(5) 



1=1 



that satisfy the same statistical relations as ©-(Ell: Since 
the coefficients in the two basis sets are related by unitary 
transformation 
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the statistical relations of are derived as 
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In view of energy eigenstates, uniform random vector 
contains all eigenstates with equal probability and rep- 
resents the system at a very high temperature. The or- 
thonormality and completeness of random vectors 
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where I is the identity operator, are shown by using (0)- 

The most important feature of random vectors is that 
statistical average of (<I>|X|<I>) gives the trace of X as 



2 



follows: 

(( (mm )) 

= ^X„„ =tr[X]. 

n 

For numerical evaluation of (|13() with ii' samples of ran- 
dom vectors, the second term of gives the statistical 
fluctuation, 

= i E E (Ci'ki'J - <5n,„.) X„,n.. (14) 
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In the special case of Hermitian matrix X„j„2 — -'^n2ni' 
it is shown that dX becomes real number by adding the 
expression H14f) with the subscripts ni and n2 exchanged. 
The order of the fluctuation for a general matrix X is 
estimated as 
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By taking statistical average of (|15|) and carefully eval- 
uating {{C[''hn2^n3 \n[^ ^)) by usiug ^-^,we obtain 
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(16) 

where the factor — l) is factored out because 

we assumed identical probability distribution for all 
The factor 1/K ensures the behavior \5X\ ^ ex- 
pected from the central limit theorem. According to the 
inequality 
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the fluctuation becomes the smallest for a given basis set 
if and only if |^n| = 1 for each n and for each sample of 
random variables. One of such random vectors is called 
random phase vector 0, 0, and defined by 
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where 0„ are a set of independent uniform random vari- 
ables defined in [— tt, tt]. Obviously, the random variables 
^„ = e*^" satisfies the statistical relations (jSJ-Q- Fur- 
ther, each sample of random phase vectors is automati- 
cally normalized without statistical fluctuation, 

{.^random phase\^ random phase) 
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This result is consistent with the observation in ^3 that 
normalized random vectors give less statistical error than 
unnormalized random vectors. It is important to notice 
that the definition of random phase vector, |^„| = 1, de- 
pends on the choice of basis set. For example, let us ex- 
amine a unitary transformation of a random phase vector 
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The transformed random vector C„ of course satisfies the 
relation ©-HI, but |Cn| = l±cos(6ii -6*2) ^ 1. Therefore 
the random phase vector in the original basis set is not 
a random phase vector in the new basis set. For random 
phase vectors in a given basis set, the fiuctuation pt)|) 
reduces to a simple form 8] , 
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which becomes zero for diagonal matrices as expected. 
Note that the fluctuation H22I) depends on the choice of 
basis set and that it is very important for reducing fluctu- 
ation to choose a basis set that makes off-diagonal matrix 
elements smallest. For a Hermitian matrix X, in theory, 
we can choose a basis set that diagonalizes X and remove 
the fluctuation completely. 

As a complex random vector a real random vec- 
tor can be defined by using real random variables with 
identical probability distribution satisfying the statistical 
relations 
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Unlike a complex random vector, a real random vector 
in one basis set is not necessarily mapped by unitary 
formation (|SJ) to a real random vector in another basis 
set. However, if we stick to the original basis set, the 
equations (|lll) - (|15|l are also valid for real random vectors. 
Before evaluating \6X\'^ for real random vectors, let us 
assume that X is symmetric matrix for simplicity. This 
does not limit the generality of our argument because any 
matrix X can be decomposed into the sum of symmetric 
and antisymmetric parts. 



X^l{X + X') + ^iX-X') 
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where X* represents the transpose of X, and tr[X] = 
tr[5'] since tr[v4] — 0. Therefore if X is not symmetric 
then we may calculate the trace of the symmetric part S 
in place of X. By taking statistical average of (|15|) and 

carefully evaluating ( (Ci? C"? ^"3 ^ ^"4 ^ ) ) by using 
(PH), we obtain 
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As in the case of complex random vectors, the real ran- 
dom vectors with minimum fluctuation are random vec- 
tors with ^„ = ±1, [random sign vector). Random sign 
vector may be regarded as a random phase vector with 
binary phase 0„ = 0, tt and satisfies the normalization 
(|19|l without fluctuation. For random sign vectors in a 
given basis set, the fluctuation H2t)|) reduces to a simple 
form 
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which is twice of the fluctuation of random phase vec- 
tors (|22|l . Therefore use of random phase vector rather 
than random sign vector is recommended except in spe- 
cial cases where the evaluation of matrix elements is sub- 
stantially accelerated by using real numbers instead of 
complex numbers. 

Let us illustrate efficiency of various types of random 
vectors in the case of the Hamiltonian operator for a par- 
ticle moving in one-dimensional space under the influence 
of local potential V{x), 
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Discretizing the one-dimensional space of size L — NAx 
into N meshes Xi = iAx, [i — 1, - ■ ■ , N) by finite differ- 
ence method gives the matrix of the Hamiltonian, which 
is tridiagonal. 
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(5ij_i) + SijV{xi). 



(29) 

The fluctuation ^ for random phase vector, complex 
random Gaussian vector, random sign vector, and real 
Gaussian vector is respectively estimated as 2, 6, 4, and 

12 in the unit of ' ^ 
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by using the fact that 



(dCnl )) — li2,l, and 3 respectively and the equations 
(fTB|l and The fluctuation of random phase vector 



is 6 times less than that of real Gaussian random vector, 
which means that 6 times less sampling is necessary for 
a given accuracy. 

In contrast to the random vector dependence of fluc- 
tuation, it is rather difficult to discuss the self-averaging 
effect in a general way because the effect depends on the 
choice of basis set and also on the physical nature of the 
matrix X such as whether the quantity is intensive or 
extensive. Therefore let us first examine self-averaging 
effect of energy density in the above example. The fluc- 
tuation for random phase vector becomes 
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as iV ^ oo, indicating self-averaging behavior 1/y'N. It 
is interesting that the local potential V{x) does not con- 
tribute to the fluctuation at all in the calculation with 
the real space basis set. When the spectral density such 
as DOS and linear response function, the dimension N 
of matrix in the above estimation should be replaced by 
Nef f , the number of resonances with in the spectral reso- 
lution 77, e.g. Neff = p(^)fy in case of DOS. Therefore to 
reach the same accuracy we need more random vectors 
for higher energy resolution or low temperatures. See 
Ref. [i3 for more sophisticated analysis. To understand 
general tendency of self-averaging, let us assume typical 
matrix elements of X have value of 0(1). Then the fluc- 
tuation |<5Xp in Hlt)|) and (|26|l becomes 0{N) for sparse 
or banded matrices and 0{N'^) for dense matrices. Since 
the average value, tr[X], becomes 0{N), the relative fluc- 
tuation SX/tT[X] becomes Oil/^/N) and 0(1), respec- 
tively. Therefore self-averaging is effective for sparse or 
banded matrices but not for dense matrices. 

In summary, we have proved that random phase vector 
is the most efficient choice among random vectors with 
identical probability distributions satisfying (El-Q. The 
fiuctuation for random phase vector is expressed as the 
sum of the square norm of off-diagonal elements. We 
show also that the smallness and sparseness of the off- 
diagonal elements is important for the self-averaging ef- 
fect. Therefore we conclude that it is important for ef- 
ficient calculation to use random phase vector and to 
choose a basis set that makes the off-diagonal elements 
sparse (or banded) and small. 
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